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ABSTRACT 

We present a family of robust tracer mass estimators to compute the enclosed mass of galaxy 
haloes from samples of discrete positional and kinematical data of tracers, such as halo stars, 
globular clusters and dwarf satellites. The data may be projected positions, distances, line 
of sight velocities or proper motions. The estimators all assume that the tracer population 
has a scale-free density and moves in a scale-free potential in the region of interest. The 
circumstances under which the boundary terms can be discarded and the estimator converges 
are derived. Forms of the estimator tailored for the Milky Way galaxy and for M3 1 are given. 
Monte Carlo simulations are used to quantify the uncertainty as a function of sample size. 

For the Milky Way galaxy, the satellite sample consists of 26 galaxies with line-of-sight 
velocities. We find that the mass oftheMilky Way within 300 kpc is M300 = 0.9±0.3x10'2Mq 
assuming velocity isotropy. However, the mass estimate is sensitive to the assumed anisotropy 
and could plausibly lie between 0.7 - 3.4 xIO'^Mq, if anisotropics implied by simulations or 
by the observations are used. Incorporating the proper motions of 6 Milky Way satellites into 
the dataset, we find M300 = 1.4 + 0.3 x IO'^Mq. The range here if plausible anisotropics are 
used is still broader, from 1.2 - 2.7 x lO^^M©. Note that our error bars only incorporate the 
statistical uncertainty. There are much greater uncertainties induced by velocity anisotropy 
and by selection of satellite members. 

For M31, there are 23 satellite galaxies with measured line-of-sight velocities, but only 
M33 and IC 10 have proper motions. We use the line of sight velocities and distances of the 
satellite galaxies to estimate the mass of M31 within 300 kpc as M300 = 1.4 + 0.4 x IO'^Mq 
assuming isotropy. There is only a modest dependence on anisotropy, with the mass varying 
between 1.3 - 1.6 x lO'^M©. Incorporating the proper motion dataset does not change the 
results significantly. Given the uncertainties, we conclude that the satellite data by themselves 
yield no reliable insights into which of the two galaxies is actually the more massive. 

Leo I has long been known to dominate mass estimates for the Milky Way due to its sub- 
stantial distance and line-of-sight velocity. We find that And XII and And XIV similarly dom- 
inate the estimated mass of M3 1 . As such, we repeat the calculations without these galaxies, 
in case they are not bound - although on the balance of the evidence, we favour their inclusion 
in mass calculations. 

Key words: galaxies: general - galaxies: haloes - galaxies: kinematics and dynamics - galax- 
ies: individual: M31 - dark matter 



1 INTRODUCTION 

The structure and extent of dark matter haloes have important im- 
plications for modem astrophysics, yet the determination of such 
properties is a difficult task and the results are often conflicting. A 
neat illustration is provided by the usage of Sagittarius Stream data 
to constrain the shape of the Milky Way dark halo. This has told 
us that the halo is nearly spherical jpellhauer et"Zll2006l) . prolate 
('Helmi 2004"), oblate jjohnston etd]|2005l) or triaxial (Law et al. 
[2009) in nature! The Milky Way is the closest halo available for 



our study, the availability of data has improved substantially in re- 
cent years, and yet we are not able to determine its shape reliably. 

Similarly, we are unable to measure the masses of the Milky 
Way, or its neighbour, the Andromeda Galaxy (M31) with any pre- 
cision. Despite their proximity to us, their masses remain sketchily 
determined and there is some controversy as to which halo is more 
massive. Judged by criteria such as the surface brightness of the 
stellar halo or the numbers of globular clusters or the amplitude of 
the gas rotation curve, M3 1 is seemingly the more massive. Judged 
by criteria such as the velocities of the satellite galaxies and distant 
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globulars or tidal radii of the dwarf splieroid als, then the Milky W ay 
is seemingly the more massive. For example. lEvans et"^ ( l200cl) ar- 
gued that the M3 1 halo is roughly as massive as that of the Milky 
Way, with the Milky Way marginally being the more massive of the 
two, while rece nt studies have found evidence favo uring both the 
Milky Way (e g.lEvans & W ilkinson 2000; Gottesm an et al.ll2002h 
and M31 (e.g. Klv pin etrij|2002l : IKarachentsev et alj|2009h as the 
more massive galaxy. 

The masses of both haloes within a few tens of kiloparsecs 
are reasonably well constrained by gas ro tation curve data (e.g. 
iRohlfs & Kreitschmamilll988l : lBraunlll99ll) . However, these data 



only sample the inner parts of the haloes. In order to probe fur- 
ther out, we must turn to the kinematics of the satellite popula- 
tions. Such tracers are a valuable tool for studying the dark matter 
haloes as their orbits contain important information about their host 
potential. Distance, radial velocity and proper motion data can be 
use d to constrain halo extent, mass and velocity anisotropy (see 
e.g.lLittle & Tremaine|[T987l : lKochanek|[l99^: IWilkinson & Evanj 
1 199^ 

The uncertainties in the mass estimates for the Milky Way and 
M3 1 are largely due to the fact that there is seldom proper motion 
data available to complement distance and radial velocity informa- 
tion. With only one velocity component to work with, the eccentric- 
ities of the orbits are poorly constrained. Statistical methods must 
be applied to determine masses and these methods suffer greatly 
from the small sample sizes available, even with the recent burst of 
satellite discoveries associated with both galaxies. 

The projected mas s estimator was introduced by 
iBahcall & Tremaind ( Il98lb . They assumed that only projected 
distance and line-of-sight velocity informatio n was availa ble. 
The estimator is also contained in the study of I White! ( Il98lh on 
sc ale-free ensembles o f binary galaxies. The an alysis was extended 
bv lHeisler et"al] ( 1 19851) and further modified bv lEvans et ai] ( l2003h 
to consider the case of tracer populations. These previous studies 
successfully used the mass estimator to weigh M31. However, in 
its present form, the mass estimator is ill-suited for application to 
the Milky Way and such a study has not yet been attempted. 

Here, we develop alternative forms of the estimator, and 
analyse the conditions under which they are valid. In addition, 
the census of satellites around M31 has increased significantly 



incantty 
alj2007l : 



jZucker et alj2()()4l2007l:lMartin et al.l2006l:lMaiewski et al. 

llbata et alj2007l : llrwin et alj2008l : lMcConnachie et al.l2008l) since 
the last studies of this type were attempted and so we have more 
data at our disposal. Hence, we apply our estimator to M31 with 
these new data. 



the case in which tracers may track the total mass (e.g. in galaxy 
groups). They found that C = 32/ji for particles with an isotropic 
velocity distribution and C = 64/jt for particles on radial orbits. A 
key assumption in this work is that the members/tracers track the 
mass of the group/host. This is not true for all tracer populations, 
particularly for those tracers which are commonly used to estimate 
the masses of ellipticals or the haloes of spiral galaxies. 

2.1 Tracer Mass Estimator 

Here, we give a formal derivation of our tracer estimators, so as to 
clarify the conditions under which they converge to the enclosed 
mass. Readers primarily interested in applications, and willing to 
take convergence on trust, should skip straight to the estimators 
themselves, namely eqns J16b . ( 123b . ( 124b and i26\ . We give for- 
mulae for the various cases in which true distances or projected 
distances, and line-of-sight velocities, or radial velocities or proper 
motions, are known for the tracers. The estimators are both simple 
and flexible. 

Let us begin by supposing that the observations are discrete 
positions r and radial velocities v,- of N members of a tracer popu- 
lation. Here, r is measured from the centre of the host galaxy, whilst 
vv = r is the radial velocity. We propose to combine the positional 
and kinematic data to give the enclosed mass M in the form 



M = -(vy) = -iyv>;'. 



(2) 



Here, unlike equation O, the constant C is not necessarily dimen- 
sionless. Notice that a priori we do not know the best choice for A. 
This will emerge from our analysis. 

If / is the phase space distribution function of the tracers and 
(Ti the radial velocity dispersion, we see that under the assumption 
of spherical symmetry: 



— fdVd^^/vV' = — f p<T^r''^'^ dr 



(3) 



(4) 



where M, is the mass in the tracers 

M, = 4jt r-p dr. 

Now, let us assume that the tracer population is spherically sym- 
metric and has a number density which falls off like a power-law 



p(r) oc r 



dlogp 
dlog r 



-y 



(5) 



2 MASS ESTIMATORS 



The projected mass estimator jBahcall & Tremainell98ll) takes the 
form 



C 

M = -( 
G 



vLR) 



La 



(1) 



for a set of A' tracers objects (e.g planetary nebulae, stars, globular 
clusters, dwarf spheroidal galaxy satellites) with line-of-sight ve- 
locities v'los and projected distances R. Here, G is the gravitational 
constant and C is a constant determined by the host potential and 
the eccentricity of the orbits. They found that C = 1 6/ it for test par- 
ticles with an isotropic velocity distribution orbiting a point mass 
and C = 32/ji for test particles moving on radial orbits. 

This analysis was extended bv .Heisler et al., (il985i) to consider 



at least within the radius interval [ri„, rant] where the data lie. Then, 
the estimator reduces to 



M 



3-7 
log(^) 



(7^3) 

(7 = 3) 
(6) 

where log x is the natural logarithm. Once the behaviour of is 
found, we may relate this estimator to the dynamical halo mass 
M(r). This can be achieved through solving the Jeans equation, 
which reads: 



1 d(p<) ^ 2j8< 



_ GM(r) 

p dr r r^ 

Here, we have introduced /3 = 



(7) 



1 - alia], the Binney anisotropy 
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parameter, in which cr, is the tangential velocity dispersion. Now, 
yS — » cx) corresponds to a circular orbit model, yS = 1 corresponds to 
purely radial orbits and /? = is the isotropic case. We note that the 
Jeans equation (|7l( in a spherical system can be put into the form 



Qpcf, = - I Qp d'- ; 



dr. 



(8) 



If fS is independent of r, this simplifies to be 2 = 

To proceed further, the underlying gravity field is assumed to 
be scale-free at least in the interval [ri„, rom], that is, the relative 
potential up to a constant is given by 
,,2 ^ 

(a + 0) 

(9) 

(a = 0) 



a\ri 
,v5log(^) 



with - \ < a < l^Here, a is a fiducial radius, which should lie in 
the region for which the power-law approximation for the relative 
potential is valid (i.e., ri„ < a < ro„t) and Vo is the circular speed 
at that radius a. When a = 1, this corresponds to the case in which 
the test particles are orbiting a point-mass; when a = 0, the satel- 
lites are moving in a large-scale mass distribution with a fiat rota- 
tion curve; when o- = y - 2, the satellites track the total gravitating 
mass. We remark that our model of a scale-free tracer population of 
satellites in a scale-free potential has previously been us ed to s tudy 
the mass of the Milky Way by Kules s a & Lvnden-Belil jl992h . al- 
though using the standard technique of maximum likelihood for 
parameter estimation. 

The scale-free assumption is also equivalent to proposing the 
halo mass profile to be 



(10) 



M(r) 
M(a) 

and the local mass density oc ^"("+2) Consequently, if the power-law 
behaviour were allowed to be extended to infinity, the total mass of 
the dark halo would necessarily be infinite unless a = I. (However, 
if the halo density were to fall off faster than r"' and so the total 
gravitating mass is finite, the leading term for the potential would 
be Keplerian. That is to say, for the case of a finite total mass halo, 
the gravity field experienced by the tracers may be approximated 
to be that of a point mass, given that r,„ is chosen to be sufficiently 
large so that the gravitating mass inside the sphere of rj,, dominates 
the mass within the shell region populated by the tracers.) 

Combining this with the constant-anisotropy assumption, the 
Jeans equation integrated between r and r„^i then reduces to 



r2/5-r, 



fr?(r)- 



GM(a) r™ 



(11) 



provided that all our assumptions remain valid in the radius interval 
hn, '"oui] and r,ae [ri„, r^ui]- 

Now, our goal is to find the total halo mass. In reality, the 
observed tracers are only populated up to a finite outer radius, and 
so, any mass distribution outside of that radius does not affect our 
observations in a strictly spherical system (Newton's theorem). We 
therefore extend the power-law potential assumption only up to the 



' Q! = -1 corresponds to the gravitational field that pulls with an equal 
magnitude force regardless of radius, which is formally generated by a halo 
density falling off as r"'. Provided we regard the scale-free potentiaf as 
an approximation valid over a limited range and not extending to spatiaf 
infinity, we can permit a > -2, since a = -2 corresponds to the harmonic 
potential generated by a homogeneous sphere. 



finite outer radius (here raud, and set a = raut- In other words, the 
halo mass that we are interested in is that contained within the outer 
radius, M = M{raut)- With a — Tout) solving equation < 11 11 1 for cr2(r) 
results in (here s = r/raud 



o",-(''out) - V(, v; 



cr^(rout) - vl log ^ 



(12) 



(tt = 2/3-7) 



where = GM/ra^t is the circular speed at r^ui whilst Vg = v^/(a + 
J -213). 

Then, substituting the result of equation I ll2t into equation ^ 
and explicitly performing the integration yields (ignoring particular 
parameter combinations that involve the logarithm) 



ivy) J- 

(3-y)C {A-a + 3-y)(a + y-2/3) 1 



1 



^ - 2/3 + 3 



cr, (roi,,) - 



or + 7 - 2/3 



1 - 

1 - u^-y 



(13) 



where u = rm/r„^,. Notice now that the choice of A = a makes the 
w-dependence of the first term in the right-hand side drop out. In 
fact, this could also have been deduced on dimensional grounds by 
requiring that our estimator is not dominated by datapoints at small 
radii or large radii. 

The last terms in equation il3\ basically constitute the surface 
'pressure' support terms in the Jeans equation, which we wish to 
minimize as u — > 0. Here, we limit ourselves to the case that A = a, 
when the corresponding leading term is 



1 _ ^-2li+3 

1 - u^-y 



—u 

-uy-^ 

„o-+y-2/J 



2/3 - c, 7 < 3 
7 < 3 < 2/3 - a 
2/3 - Q- < 3 < 7 ' 
3 < 2/3 - a, 7 



(14) 



In other words, provided that 7 > 3 and y > 2p — a, the pressure 
term va nishes as m ^ 0, a nd we obtain the scale-free Jeans solu- 
tions of lEvans et al.l ( Il997l) . In fact, since /3 < 1 and -1 < ff < 1, 
we find that 2/3 - a < 3 and thus the second condition here is es- 
sentially redundant. Consequently, provided that 7 > 3, that is the 
tracer density falls off more quickly than r"^, we find the estimator 
to be 



GM 



a + y-2p r„M 



(15) 



where the remainder 7? — > vanishes as r-mlt'ont — > (here, r\n and 
Toui are the inner and outer radius of the tracer population). 

Alternatively, if 7 < 3 and 21} - a < 3, the remainder term 
tends to a constant as u — » 0. In a perfectly scale-free halo traced 
by again strictly scale-free populations, this constant must be zero. 
This is because, for such a system, cr^ should also be scale-free. Yet 
equation J12t implies that this is possible only if iT^(rout) = Vq- Sub- 
sequently this also indicates that the coefficient for the remainder 
in equation l |13t vanishes too. Even after relaxing the everywhere 
strict power-law behaviour, we would expect that a] ~ and con- 
sequently that — Vgl <K Vq, provided that 2(3 — a < y, which is 
required to ensure > 0. That is to say, we expect that v^r^^, » K 
as M — > in equation ( I15t for 2/3 - a < 7 < 3, which is sufficient 
for justifying the applicability of our mass estimator. 

In other words, we have obtained a very simple result 



C = (a + y-2p) rl 



(16) 
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provided that C > (the simple interpolative argument indicates 
that this is still valid for 7 = 3). This corresponds to the case in 
which the tracers have known radial velocity components v, re- 
solved with respect to the centre of the galaxy, as well as actual 
distances r. For satellites of the Milky Way, the line of sight veloc- 
ity Vios is measured, and corrected to the Galactic rest frame. Now, 
V,. may be calculated from vi^s only if proper motion data exists. 
Alternatively, a statistical correction can be applied to estimate v, 
from v'los 



(17) 



1 - /? sin ifi 

where (f is the angle between the unit vector from the Galactic Cen- 
tre to the satellite and the unit vector from the Sun to the satellite. 

Note too that in the important isothermal case (a = 0), the 
galaxy rotation curve is flat with amplitude vq. Then, for members 
of a population with density falling like p ~ r"', such as the Galac- 
tic globular clusters, eqn 1 II6I 1 reduces to 

4 = (3 - 2/?)<v?). (18) 

This i s a generalization of the estimator of iLvnden-Bell & FrenkI 
( Il98ll) to the case of anisotropy. When the population is isotropic 
(fi = 0), it reduces to the appealing simple statement that the cir- 
cular speed is the rms velocity of the tracers multiplied by V3 a 
1.732. 

Even if three dimensional distance r is replaced by projected 
distance R or vv by some other projections of the velocity, the ba- 
sic scaling result of eqn il6\ remains valid. Different projections 
simply result in distinct constants C, as we now show. 

2.2 A Family of Estimators 

Now, suppose that we have actual distances r from the centre of the 
host galaxy, but only projected or line of sight velocities V|os. This 
is the case for many of M3 1 's satellite galaxies, for which distances 
have been measured by using the tip of the red giant branch method 
(see e.g.. iMcConnachie et alJllOOSh and for which projected ve- 
locities are known from spectroscopy. The calculation proceeds by 
considering 



i-/dVdVvL'-^ = ^X'""d.£d.po-L. 



(19) 

We now need the relationship between line-of-sight velocity dis- 
persion cTios and the radial velocity dispersion cTi , namely 

<rL=(r;{l-Psm^,p), (20) 

which is similar to equation ( I17t but here the angle ip is the 
angle between the line of sight and the position vector of the 
satellite with respect to t he centre of the host galaxy (see e.g., 
iBinnev & Tremaine|[T987l. Section 4.2). If the polar z-axis of the 
coordinate system is chosen such that the sun (that is, the observer) 
lies on the negative z-axis (i.e., 9 = Jt) at a distance d from the 
centre of the host galaxy, we find that 



sm~(p ■■ 



sin' 6 



1 +2^cos^ 



2 ■ 



(21) 



However, for most external galaxies, it is reasonable to assume d » 
rout, and therefore, we can safely approximat^ that sin^ (p a sin^ 6. 

^ On the other hand, for the satellites of the Milky Way, it is often assumed 
that d <K Tin, which leads to sin^ » and consequently (v^^^r") a {v^r"). 



Then, 



d0 sin6)(l -/?sin-e), 





and thus we find that 



_ 3(ft-Hy-2/j) 



(22) 



(23) 



3-2/3 

Next, we consider the case in which we have full velocity in- 
formation for the satellites, i.e., both radial velocities and proper 
motions. For example, this is the case for a s ubset of the satellites 
of the Milky Way (see e.g.. |Piatek et alj200 2'). In this case, we can 
utilize tr^ = cr^ + cr^ = (3 - 2P)cr^, and therefore the estimator 
becomes 



C 



C 



a + j-2p 

" ''nut ■ 



(24) 



3-2/? 

Finally, we can assume a worst-case scenario in which the 
only data available are projected distances R and line-of-sight ve- 
locities vios for the tracers. Outside of the galaxies of the Local 
Group, this is the usual state of affairs. So, this would be the form of 
the estimator to find the dark matter mass of nearby giant ellipticals 
like M87 from positions and velocities of the globular clusters. The 
estimator is derived following the same procedure with R = rsin 9, 
which results in the relation 

{vlX} = (^y} J d6) sin"+'(9(l -/Jsin^e). (25) 
Consequently, the corresponding estimator is found to be[^ 



(a + y- 2p) 



I. 



aJ3 



where 



ii'/2r(f + 1) 
4r(f + f ) 



[a + 3 - /3(a + 2)] 



(26) 



(27) 



and r(.i:) is the gamma function. This case is related to work by 
Bahcall & Tremaine (1981). So, for example, in the Keplerian case 
(a = 1), a distribution of test particles with y = 3 gives 



C 



32 2-p 
Jt 4- 3/3 



(28) 



When p = Q, this implies that C = 16/jt; whilst when /3 = 1, 
C = 32/11. 

Some of these estimators are implicit in othe r work. In par - 
ticular, some are equivalent to those introduced by I White! l ll98lh . 
who had a different focus on the dynamics of binary galaxies but 
who made the same scale-free assumptions to obtain robust mass 
estimators. Very recently. An & Evans (2010, in preparation) found 
a related family of estimators that are independent of parameters 
derived from the tracer density (like 7). 



3 CHECKS WITH MONTE CARLO SIMULATIONS 

In order to verify the correctness of our mass estimators, we gener- 
ate synthetic data-sets of anisotropic spherical tracer populations. 



^ The result is valid provided that the integral is limited to spherical shells. 
However, given the lack of depth information, it might seem more logical to 
perform the integration over cylindrical shells. Unfortunately, the result is 
more complicated, as it involves the integrals of incomplete beta functions. 
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Figure 1. Distribution of mass estimate as a fraction of tlie true mass for 1000 Monte Carlo realisations, assuming that parameters a, j} and y are known 
exactly. Left: N = W, 000. Middle: N = 100. Right: N = 30. The number of satellites in the simulation and the form of the estimator used to recover the mass 
is shown in the top left comer of each panel. A best-fit Gaussian is plotted for each distribution and the standard deviation of the distribution is shown in the 
top right comer of each panel. On average, the tracer mass estimator recovers the tme mass of the host. [The cases shown correspond to £!■ = 0.55. /3 = 0.0 and 

y = 2.7]. 



Distances r are selected in [r]„, roui] assuming thie power-law den- 
sity profile in eqn l|5j. Projection directions are determined by the 
position angles: cos 9 is generated uniformly in [- 1, 1] and is gen- 
erated uniformly in [0, 2ii]. If R lies outside of the allowed range, 
the projection direction is regenerated until R is within [i?in,^out]- 

The phase-space distri bution fu nctions that give rise to such 
density profiles are given in lEvans e t al. 1.1997.') . Tracer velocities 
are picked from the distributions 



/(v) ' 



2|[2y-3«-2/3(2-ff)]/(2ff) 
2^ I 



(a + 0) 



(Qr = 0) 



(29) 



For a > 0, the maximum velocity at any position is ^2<j/{r)\ 
for g < 0, the velo c ities c an become arbitrarily large. Following 
Isirmev & Tremain3 ( Il987h . we introduce spherical polar coordi- 
nates in velocity space (v, ^, 77) so that the velocities resolved in 
spherical polar coordinates with respect to the centre are then 



V cos 77 I'fl = v sin T] cos ^ = v sin rj sin <f 



(30) 



To generate velocities with the correct anisotropy, <f is generated 
uniformly in [0, 2ji] and rj is picked in [0, n] from the distribution 



F(77) oc Isinzyl 



(31) 



where p is the Binney anisotropy parameter. Finally, the line-of- 
sight velocities are calculated and used in the tracer mass estimator. 

Figure [T] shows the distribution of mass estimates as fractions 
of the true mass for 1000 realisations, assuming that parameters a, 
p and 7 are known exactly; the left panels show simulations with 
10,000 tracers, the middle panels for 100 tracers and the right pan- 
els for 30 tracers. The panels use the different forms of the estima- 
tor given in eqns il6\ . i23\ . i24\ and l l26t respectively. A Gaussian 
with the same standard deviation as each distribution is also plotted 
for each panel. The standard deviation is included in the top-right 
comer of each plot and gives an estimate of the error in each case. 

We see that our mass estimators are unbiased - that is, on av- 
erage, the true mass is recovered in all cases. The benefit of using 
three dimensional distances r instead of projected distances R is 
modest, as is the improvement gained by using vv in place of \%s- 
However, if proper motion data are available, then using v instead 
of v'r gives a more accurate mass estimate. 

So far, we have assumed that we know a, p and 7 exactly, 
which is, of course, not the case. Our estimates for a, p and 7 
have errors associated with them, not least because the notion of 
a scale-free density profile in a scale-free potential is an idealiza- 
tion. As these parameters enter the estimator through the prefactor 
C, it is straightforward to obtain the additional uncertainty in the fi- 
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nal answer using propagation of errors. As we will show in the next 
section a and 7 are constrained either by cosmological arguments 
or by the data. The right-most column in Figure [T] (a host with 30 
satellites) is the most applicable to our data-sets at present as the 
Milky Way has 26 satellites and M3 1 23 satellites with a recorded 
line-of-sight velocity. The error on the mass estimate obtained in 
this case is ~ 25%. This is much larger than that the effects of er- 
rors on a and 7 and so the latter will be ignored for the rest of the 
discussion. 

However, the case of the velocity anisotropy /3 is different as 
it is poorly constrained, with theory and data pointing in rather dif- 
ferent directions. Changes in p can therefore make a substantial 
difference to the mass estimate. 

Note that these simulations yield no insight into systematic er- 
rors, because the mock data are drawn from the same distribution 
functions used to derive the form of the mass estimators. This is a 
concern as there are a number of causes of systematic error - for 
example, dark halos may not be spherical, or infall may continue 
to the present day so that the observed satellites may not necessar- 
ily be virialized. Deason et al. (2010, in preparation) have tested 
the estimators derived in this paper, as well as a number of other 
commonly used mass estimators, against simulations. Specifically, 
they extracted samples of Milky Way-like galaxies and their satel- 
lites from the Galaxies Intergalactic Medium Interaction Calcula- 
tion ( ICrainetal.ll200^ , a recent high resolution hydrodynamical 
simulation of a large volume of the Universe. They find that the es- 
timators in t his paper significantly out-p erform the projected mass 
estimator of Bahcall & Tremaing ( Il98lh and the tracer mass esti- 



Table 1. Data table for ttie satellites of tlie Milky Way. Listed are Galactic 
coordinates (/, h) in degrees, Galactocentric distance r in kpc and corrected 
line-of-sight velocity in km s"' . 



mator of lEvans et al. U2OO3I) . 





/ 

^.^eg; 


h 

^.^eg; 




(km s"') 




Bootes I 


358.1 


69.6 


57 


106.6 


1,2 


Bootes II 


353.8 


68.8 


43 


-115.6 


3,4 


Canes Venatici I 


74.3 


79.8 


219 


76.8 


5,6 


Canes Venatici II 


113.6 


82.7 


150 


-96.1 


6,7 


Carina 


260.1 


-22.2 


102 


14.3 


8,9 


Coma Bemices 


241.9 


83.6 


45 


82.6 


6,7 


Draco 


86.4 


34.7 


92 


-104.0 


8,10,11 


Fornax 


237.3 


-65.6 


140 


-33.6 


8,12,13 


Hercules 


28.7 


36.9 


141 


142.9 


6,7 


LMC 


280.5 


-32.9 


49 


73.8 


8,14,15 


Leo I 


226.0 


49.1 


257 


179.0 


8,16,17 


Leo II 


220.2 


67.2 


235 


26.5 


8,18,19 


Leo IV 


265.4 


56.5 


154 


13.9 


6,7 


Leo T 


214.9 


43.7 


422 


-56.0 


6,20 


Leo V 


261.9 


58.5 


175 


62.3 


21 


SMC 


302.8 


-44.3 


60 


9.0 


8,22,23 


Sagittarius 


5.6 


-14.1 


16 


166.3 


8,24 


Sculptor 


287.5 


-83.2 


87 


77.6 


8,25,26 


Segue 1 


220.5 


50.4 


28 


113.5 


3,27 


Segue 2 


149.4 


-38.1 


41 


39.7 


28 


Sextans 


243.5 


42.3 


89 


78.2 


8,9,29 


Ursa Major I 


159.4 


54.4 


101 


-8.8 


3,6 


Ursa Major II 


152.5 


37.4 


36 


-36.5 


6,30 


Ursa Minor 


104.9 


44.8 


77 


-89.8 


8,10,11 


Willman 1 


158.6 


56.8 


42 


33.7 


2,3 



4 MASS ESTIMATES FOR ANDROMEDA AND THE 
MILKY WAY 

4.1 Choice of Power-Law Index Parameters 

We now apply the mass estimators to the Milky Way and M31, 
the two largest galaxies in the Local Group. In converting he- 
liocentric quantities to Galactocentric ones, we assume a circu- 
lar speed of 220 km s^'at the Galactocentric radius of the sun 
{Rq, = 8.0 kpc) and a solar peculiar velocity of {U, V, W) = 
(10.00,5.25,7.17) km s"', where U is directed inward to the Galac- 
tic Centre, V is positive in the direction of Galactic rotation at the 
position of the sun, and W is positive towards the North Galactic 
Pole (see e.g., .Dehnen & Binnev.199 8). 

For the M31 satellites, positional and velocity data must be 
computed relative to M31 itself. We take the position of M31 to be 
{{, b) = (121.2°, -2L6°) at a distance of 785 kpc and its line-of- 
sight velocity to be -123 km s^'in the Galactic rest frame (see e.g., 
McConnachie et al. 2005; McConnachie & Irwin 2006). 

In order to apply our estimators to these systems, we need to 
compute the power-law index of the host potential a, the velocity 
anisotropy p and the power-law index of the satellite density dis- 
tribution y. There are cosmological arguments suggesting that the 
potentials of dark haloe s are well-approxima ted by Navarro-Frenk- 
White (NFW) profiles jNavarro et^l 19961) . Figure [2] shows the 
best-fit power-law to the NFW potential for a wide range of con- 
centrations and virial radii. The fitting is performed in the region 
10 < r/ kpc < 300, which i s where the majority of the satellites 
lie. Now, lavpin etaljj2002h argued that the concentrations of the 
Milky Way and M31 are c ~ 12, whilst the virial radii fvi, are in 
the range 250-300 kpc. In other words, for the range of concentra- 
tions and virial radii appropriate to galaxies like the Milky Way and 



Sources: 1 - iBelokuroy et alj <2q06ll. 2 
Koch et al.' 



Martin et al.' CTOgD, 4 

Simon & Geha (2 00?!) , 7 -iBelo kurov et a lj <200' 
12004 ). 9 - IMateol Il998h . 10 - LB ellazzini et al. 



- iMartin et"^ 
5 - JZucker et al 
1 



l2007h 3 - 
j2006l). 6 - 



Karachentsev et alj 

. . ^ .' hmi), 11 - 

Arma ndroffet alj h995l), 12 - ISaviane e t al. (2000), 13 - Walk er^et alj 
(2006), 14 - Freedman et al. (2001), 15 - van der Marel et al. (2002), 16 
- Bellazzini et al. (2004), 17 - Kochetal. (2007I). 18 - iBellazzini et all 

( 2005) , 19 - iKoch et al. ( 2007), 20 - llrwin et al. ( 2007), 21 - Beloku rov eTai] 

(2006) , 22 - ICioni et a l. ( 26o9|), 23 - "Harris & Zaritsky (200^, 24 - 
llbata et alj il997l), 25 -iKaluznv et al. ( 1995), 26 - Oueloz et al. ( 1995), 27 
- iGeha et aljj2009l). 28 - IBelokurov et aL >2009i) . 29 - iWalker et al.i >200^ . 
30- IZuckeretalJj2006l) 

M31, we see - fortunately - that the surface in Figure[2]is slowly- 
changing and flattish with a x 0.55. 

If the satellite number density distribution nir) follows a 
power law with index 7, then the number of satellites within any 
radius, A'(< r), also follows a power-law with index 3-7. We fit 
power-laws to the Milky Way and M31 satellite cumulative distri- 
butions in order to estimate 7. We restrict ourselves to the inner re- 
gions of the satellite distributions, r < 300 kpc; beyond this range, 
the satellite population is likely to be seriously incomplete. The 
distributions and the best-fitting power laws are shown in Figure[3l 
the Milky Way data is shown in the upper panel and M3 1 data is 
shown in the lower panel. We find 7 = 2.6 for the Milky Way and 
7 = 2.1 for M31. Note th at data from the Sloan Digital Sky Survey 
(SDSS; l^rk et al.l2000l) has been instrumental in the identification 
of many of the recently-discovered Milky Way dwarfs. The SDSS 
coverage includes only the region around the North Galactic Cap, 
and, as such, the distribution of known Milky Way satellites is con- 
centrated in that area of the sky. However, given our underlying as- 
sumption that the distribution of satellites is spherically symmetric. 
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Figure 2. The best-fit value of the power-law index a to an NFW profile 
as a function of the concentration and virial radius. Note that for plausible 
values of the concentration c and the virial radius rvir for galaxies like the 
Milky Way and M31, (i- lies in the range 0.5-0.6. The suiface is smooth and 
flattish, implying that a is reasonably insensitive to the details of the NFW 
potential. 
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r (kpc) 

Figure 3. Cumulative numbers of satellite A'(< r) for the Milky Way (upper) 
and M31 (lower). The best-fit power laws in the range r < 300 kpc are 
also plotted. The index of these power-law fits may be used to estimate the 
power-law index of the satellite density distribution n(r). 



Table 2. Data table for the satellites of M3 1 . Listed are Galactic coordinates 
(/, b) in degrees, actual distance r from the centre of M31 in kpc, projected 
distance R from the centre of M3 1 in kpc and corrected line-of-sight veloc- 
ity in km s"'. 





/ 














^ciegj 


IQcgJ 






(km s^^) 




M33 


133.6 


-31.3 


809 


206 


74 


1,2 


M32 


121.1 


-22.0 


785 


5 


95 


2,3 


IC 10 


119.0 


-3.3 


660 


261 


-29 


2,3,4 


NGC 205 


120.7 


-21.1 


824 


39 


58 


1,2 


NGC 185 


120.8 


-14.5 


616 


189 


106 


1,2 


IC 1613 


129.8 


-60.6 


715 


510 


-56 


2,3,5 


NGC 147 


119.8 


-14.2 


675 


144 


117 


1,2 


Pegasus 


94.8 


-43.6 


919 


473 


85 


1,2 


Pisces 


126.7 


-40.9 


769 


268 


-37 


1,2 


Andl 


121.7 


-24.8 


745 


59 


-84 


1,2 


And 11 


128.9 


-29.2 


652 


185 


83 


1,2 


And 111 


119.4 


-26.3 


749 


75 


-57 


1,2 


AndV 


126.2 


-15.1 


774 


109 


-107 


1,2 


And VI 


106.0 


-36.3 


775 


267 


-64 


1,2 


And VII 


109.5 


-9.9 


763 


218 


21 


1,2 


And IX 


123.2 


-19.7 


765 


41 


94 


1,6,7 


AndX 


125.8 


-18.0 


702 


110 


130 


8,9 


And XI 


121.7 


-29.1 


785 


102 


-140 


7,10 


And XII 


122.0 


-28.5 


830 


107 


-268 


7,10,11 


And XllI 


123.0 


-29.9 


785 


115 


64 


7,10 


And XIV 


123.0 


-33.2 


740 


161 


-204 


12 


And XV 


127.9 


-24.5 


770 


94 


-57 


13,14 


And XVI 


124.9 


-30.5 


525 


280 


-106 


13,14 


And XVII 


120.2 


-18.5 


794 


45 




15 


And XVIII 


113.9 


-16.9 


1355 


589 




16 


And XIX 


115.6 


-27.4 


933 


187 




16 


And XX 


112.9 


-26.9 


802 


128 




16 


And XXI 


111.9 


-19.2 


859 


148 




17 


And XXII 


132.6 


-34.1 


794 


220 




17 



So urces: 1 -i McConnachie et al.' ('2005'), 2 -'McConnachie & Irwin' /iooS), 
3 -i Karachents ev et al. (2004), 4 - Sakai et al. ( 1999), 5 - Cole et al. ( 1993 
6 - Zu cker et alj j2004l) . 7 - Collins et al. (2009. in prep). 8 - IZucker etal] 
C2007,), 9 - Ka l irai et al. ( 2009, in prep), 10 - iMifftin et alj j2006), 11 - 

oi^ 



13 



Ibata et al 



2007h, 



U 

IChanman etall j2007l). 1 2 - iMaiewsk i et al. (20oi„ 

14 -iLetarte et al.l j2009t) 15 -llrwin et aU ( i2008l) . 16 - IMcConnachie et alj 
l2O08h . 17 - iMartin et aljiioosl) 



this directional bias does not impair our mass estimators. A big- 
ger worry may be the incompleteness in the satellite distribution, 
which could affect the power index for the tracer number density if 
the directional incompleteness varies in different distances. 

Finally, there are a number of possibilities for the veloc- 
ity anisotropy for the satellite galaxies. Previous studies often as- 
sumed isotropy, arguin g that there is no com pelling evidence to 
the contrary. However. lOiemand et alj ( IiOOtI) found that the ve- 
locity anisotropy of satellites in simulations behaves like p(r) ^ 
0.55(r/rvir)"' for 0.2r^,, < r < r^,,. To estimate j3 for the Milky 
Way and M31 satellites, we calculate the weighted mean of this 
distribution 



(32) 



where the weighting function n(r) is the satellite number density 
distribution. This gives ^ = 0.44 for the Milky Way and yS = 0.45 
for M31. This is similar to the anisotropy of halo stars (J5 = 0.37) 
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tongentiol orbits 




tangential orbits ^ 



radiQl orbits 



Figure 4. The sensitivity of the estimated mass on the anisotropy parameter 
yS for a satellite population with a = 0.55, /? = and y = 2.1 . The figure 
shows the mass recovered using the input values of a and y and varying the 
value of y8. The functional form of the curve is easy to deduce. It is a rational 
function of yS for the upper panel, which uses the estimator of eqn j23t , and 
a linear function of /? for the lower panel, which uses eqn (16). 



in simulations reported by IXue et alJ Jiooi) . Even though these 
numbers have the backing of simulations, they are somewhat sur- 
prising. Most of the Milky Way satellites with measured proper 
motions are moving on polar or tangential orbits. Using the sample 
of the 7 Milky Way satellites with proper motions, we can compute 
the radial and tangential components of the Galactocentric velocity. 
From these, the observed anisotropy /? — 4.5, which favours tan- 
gential orbi ts. This is consisten t with the earlier, though indirect, 
estimate of I Wilkinson & Evanj ( Il999h . who found /? — 1, again 
favouring tangential orbits. The origin of this discrepancy between 
simulations and data is not well understood. Perhaps there is con- 
siderable cosmic scatter in the anisotropy of the satellites, as it may 
depend on the details of the accretion history of the host galaxy. 
Figure |4] plots brings both good news and bad. The upper panel 
shows that the mass estimates for external galaxies using the line 
of sight estimator of eqn l |23l l are reasonably insensitive to the pre- 
cise value of p. This make sense, as for a galaxy like M31, the 
line of sight velocity encodes information on both the radial and 
tangential velocity components referred to the MSl's centre. How- 
ever, in the case of the Milky Way, the situation is very different. 
The measured velocities provide information almost wholly on the 
radial component referred to the Galactic Center. In the absences 
of proper motions, the velocity anisotropy is largely unconstrained 
by the data. This is the classical mass-anisotropy degeneracy, and 
so - as the lower panel shows - there is considerable uncertainty in 
the mass estimates inferred using eqn il6t . 



10 



5 - 



10 



E 5 



Hercules 



MW- 



Leo I 



M31- 



And XIV 



And XII 



0.00 0.05 0.10 0.15 0.20 0.25 0.30 
fractional mass contribution 

Figure 5. The fractional contribution each satellite makes to the mean mass 
estimator for the Milky Way (top) and M31 (bottom). For both galaxies, the 
mass budget is dominated by two satellites. For the Milky Way these are 
Leo I (red, dotted) and Hercules (blue, dashed). For M31, these are And 
XII (red, dotted) and And XIV (blue, dashed). 



In what follows, we typically quote mass estimates for the 
anisotropies derived both from observations jSdaia and from simu- 
lations jSsira, as well as for the case of isotropy (fi = 0). In the ab- 
sence of consistent indications to the contrary, our preference is to 
assume isotropy and to give greatest credence to the mass estimates 
obtained with this assumption. 



4.2 Radial Velocity Datasets 

Armed with values for a, /3 and y, we now set the mass estima- 
tors to work. Data for the satellites of the Milky Way and M3 1 are 
given in Tables [T] and [2] respectively. Objects for which no line- 
of-sight velocity has been measured (And XVII, And XVIII, And 
XIX, And XX, And XXI and And XXII) are included in the tables, 
but excluded from the analysis. 

Using eqn ( I16l > and recalling that the Monte Carlo simulations 
gave errors of ~ 25%, we give estimates of the mass with 100, 
200 and 300 kpc for the Milky Way Galaxy in Table [3] Assum- 
ing velocity isotropy, we obtain for the mass of the Milky Way 
M300 = 0.9 ± 0.3 X IO'^Mq. The cussedness of the mass-anisotropy 
degeneracy is well illustrated by the fact that using the observa- 
tionally derived ySdata gives M300 = 3.4 ± 0.9 x IO'^Mq, whilst using 
that from simulations gives M300 = 0.6 ± 0.2 x IO'^Mq. The huge 
spread in mass estimates is due to the fact that the line-of-sight ve- 
locities for the satellites are almost entirely providing information 
on the radial velocities as judged from the Galactic Centre. There 
is almost no information on the tangential motions in our dataset. 
However, there are other astrophysical reasons why masses higher 
than ~ 2 x 10'^Mq are disfavoured. 

Using eqn l |23t , we obtain the mass of M31 within 300 kpc as 
M300 = 1.4± 0.4x IO'-Mq. Here, though, in sharp distinction to the 
case of the Milky Way, plausible changes in the velocity anisotropy 
generate modest changes of the order of 10 per cent in the mass 
estimate, as shown in Table [3] Of course, this is understandable, 
as the line-of-sight velocity now has information on both the radial 
and tangential components, albeit tangled up in the projection. 

Taking the masses derived using velocity isotropy (fi = 0), we 
note that this work hints at the removal of a long-standing puzzle, 
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Table 3. Enclosed mass within 100, 200 and 300 kpc for the Milky Way and Andromeda galaxies. We offer three estimates: one using the anisotropy inferred 
from data (fi — 4.5), one assuming isotropy (,6 = 0) and the third with the anisotropy derived from simulations (f! ~ 0.45). 



Galaxy M300 (x10"Mo) M200 (x10"Mo) Mioo(x10"Mo) 







isotropic 


8 

A-* Sim 






isotropic 


8 

A-* Sim 


Bu, 


i 


isotropic 


B 


Milky Way 


34.2 ± 9.3 


9.2 ± 2.5 


6.6 ± 1.8 


21 


1 ±5.7 


5.5 ± 1.6 


3.8 ± 1.0 


13.9 ± 


4.9 


3.3 ± 1.1 


2.1 ±0.7 


... excl Leo 1 


25.2 ± 7.5 


6.9 ±1.8 


5.0 ± 1.2 


















... excl Leo 1, Her 


21.1 ± 6.3 


5.8 ± 1.5 


4.2 ± 1.1 


17 


5 ±5.3 


4.6 ± 1.4 


3.2 ±0.8 










MW with PMs 


38.6 ±7.0 


24.1 ± 5.3 


22.1 ± 5.3 


28 


3 ±5.5 


18.5 ±4.2 


17.0 ±4.2 


22.2 ± 


5.0 


13.8 ± 3.9 


11.4±3.1 


... excl Draco 


27.1 ±4.9 


14.1 ± 3.1 


12.2 ± 2.7 


18 


1 ± 3.4 


10.0 ± 2.3 


8.7 ±2.2 


12.9 ± 


3.1 


6.9 ± 1.9 


5.4 ± 1.5 


...exclLMC/SMC 


38.8 ± 6.8 


24.6 ± 5.8 


21.7 ±4.9 


28 


3 ±5.7 


18.4 ±4.3 


16.3 ±4.1 


22.7 ± 


5.6 


13.9 ±4.2 


11.2 ±3.4 


... excl Draco, LMC/SMC 


25.9 ± 5.1 


12.4 ±2.9 


10.6 ±2.5 


17 


0±3.5 


8.5 ± 2.2 


7.2 ± 1.8 


11.3 ± 


2.9 


5.4 ± 1.7 


4.2 ± 1.3 


M31 


15.8 ± 3.3 


14.1 ±4.1 


13.1 ± 3.8 


15 


4 ±4.1 


12.4 ±3.8 


10.6 ±3.5 


2.6 ± 


1.0 


2.1 ± 1.0 


1.8 ± 1.0 


...excl AndXlI 


12.2 ± 2.7 


10.9 ± 3.1 


10.1 ± 3.2 


11 


4 ±3.2 


9.2 ± 3.1 


7.9 ±2.8 










... excl AndXll, AndXlV 


9.6 ± 2.1 


8.5 ± 2.4 


8.0 ± 2.4 


8. 


6 ±2.6 


6.9 ±2.4 


5.9 ±2.2 










M31 with PMs 


15.1 ± 3.8 


13.9 ± 3.5 


13.1 ± 3.5 



















namely that the kinematic data on the satellite galaxies suggested 
that M3 1 was less massive than the Milky Way, whereas other indi- 
cators (such as the total numbers of globular clusters or the ampli- 
tude of the gas rotation curve) suggested the reverse. In fact, with 
the new datasets, the ratio of the masses of M31 to the Milky Way 
(~ 1.5) is close to that which would be inferred using the Tully- 
Fisher relationship and the assumption that the luminosity is pro- 
portional to the total mass (250'*/220'* ~ 1.67). If instead the radial 
anisotropics derived from simulations are preferred, then the ratio 
is - 1.98. 

However, it may be imprudent to include all the satellites. For 
example, Leo I has long been known to dominate mass estimates 
of the Milky Way, on account of its large distance (~ 260 kpc) and 
high line-of-sight velocity (see e.g. Kulessa & Lvnden-Belllll993 : 



iKochanekll 1 99d I Wilkinson & EvanslIT999ir It is unclear that Leo I 
is actually on a bound orbit, as opposed to a hyperbolic one. Hence, 
many attempts at determining the mass of the Milky Way quote es- 
timates both including and excluding Leo I. In fact, recent photo- 
metric and spectroscopic evidence presented bv lSohn et al.l l l2007l) 
favours the picture in which Leo I is bound on an orbit with high 
eccentricity (~ 0.95) and small perigalacticon (10-15 kpc). In par- 
ticular, such models give good matches to the surface density and 
radial velocity dispersion profiles of Le o I, and imply hig h mass 
estimates for the Milky Way. However, ISales et ai] ( |2007|) using 
simulations found a population of satellite galaxies on extreme or- 
bits ejected from haloes as a result of three-body slingshot effects, 
and suggested that Leo I might be an example of such an object. 
So, although the present evidence favours a bound orbit, a defini- 
tive verdict must await the measurement of Leo I's proper motion 
by the Gaia satellite, which should resolve the issue. 

Given that there is one satellite that is known to inflate the 
Milky Way's mass, it is interesting to investigate whether any of 
the other satellites, particularly the recent discoveries, play similar 
roles. The upper panel of Figure [5] shows the fractional contribu- 
tions each satellite makes to the Milky Way's mass (Cv\^j" l(GN)) 
- it is the total of these values that we take to be the mass esti- 
mate. There are two clear outliers; the outermost satellite in this 
distribution is Leo I, the less extreme satellite is Hercules. Like 
Leo I, Hercules has a substantial radial velocity and a relatively 
large Galactocentric distance (~ 130 kpc). Hercules has a highly 
elongated, irregu lar and flattened structure jBelokurov et al.ll2007l : 
IColeman et alj|2007) . This is consistent with tidal disruption dur- 



ing pericentric passages on a highly eccentric orbit (e > 0.9). This 
seems good evidence that Hercules is truly bound to the Milky Way. 

We repeat the same analysis for M3 1 and the results are shown 
in the bottom panel of Figure |5] Interestingly, we see that there are 
two outliers in the distribution, namely two of the recent discover- 
ies, And XII and And XIV. Notice that though both objects have a 
substantial effect on M3rs mass estimate, neither are as extreme 
as Leo I. It is the inclusion of these two new objects in the satel- 
lite dataset that has augmented the mass of M3 1 , so that it is now 
somewhat greater than that of the Milky Way. 

But, this begs the questi on: should these s atellit es be included? 
And XIV was discovered bv lMaiewski et alj ( l2007h in a survey of 
the outer M31 stellar halo. They recognized its extreme dynami- 
cal properties and suggested that it may either be falling into M3 1 
for the first time or that M3rs mass must be larger than hitherto 
estimated by virial arguments. In fact, And XIV's lack of gas and 
its elongated structure suggest that ram pressure stripping and tidal 
effects may have been important in its evolution. This is consistent 
with And XIV being a true satellite of M3 1 that has already suf- 
fered a pericentric passage, a conclusion that could be strengthened 
with deeper imaging, which might reveal the presence of tidal tails 
around And XIV. 

And XII is a still mor e ambiguous object - it was discovered 
as a stellar overdensity by Martin et al. (2006). Spectroscopic ob- 
servations were subsequently taken by Chapnian et al. (2007), who 
conjectured that the satellite might be falling into the Local Group 
for the first time. The evidence for this is its large velocity and its 
likely location behind M3I. However, it remains unclear whether 
this evolutionary track is consistent with the absence of detection 
of HI gas in the object. Pristine, infalling dwarfs, which have not 
yet experienced a pericentric passage of 50 kpc or less, should re- 
tain sizeable amounts of neutral HI gas, whereas IChapman et al.l 
(2007) constrain the mass in HI to be less than 3 x 10' Mg. 

In light of this, we provide more mass estimates, after remov- 
ing possible ambiguous objects (and re-computing the parameter y 
where necessary). For the Milky Way, we exclude Leo I only and 
then Leo I and Hercules. For M3I, we exclude And XII only and 
then And XII and And XIV. These mass estimates are also shown 
in Table [3] Note that, for example, the exclusion of Leo I does not 
change the mass estimate within 100 or 200 kpc, as Leo I is out- 
side of this range. Similarly, And XII and And XIV lie outside of 
100 kpc from the center of M31, so the mass estimates without 
them do not change the final column of the table. 
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Figure 6. Distribution of /? obtained from simultaneous mass and velocity anisotropy fitting. The median of each distribution is shown as a blue dashed line 
and the 68 % confidence limits as cyan dotted lines. These values are also given in blue and cyan on the plot. The left panel shows the idealised case for 500 
tracers, the right panel the case tailored to our M31 data where only 21 tracers are used. The value of /i used to generate the tracer population is also shown. 



In the case of velocity isotropy (y6 = 0), it requires the excision 
of both And XII and And XIV from the datasets for the mass es- 
timate of M3 1 to become comparable to or smaller than the Milky 
Way. For example, the mass of M31 with And XII and And XIV 
both removed is 0.85 ± 0.24 x IO'^Mq, as compared to the mass of 
the Milky Way with Leo I retained of 0.92 ± 0.25 x IO'-Mq. How- 
ever, we have argued that And XIV is most likely bound, whilst 
And XII is a more a mbiguous case. In other words, the problem 
pointed to by ISvans & Wilkinson ( 2000) - namely that the mass 
of M3 1 inferred from the kinematics of the satellites is less than 
the mass of the Milky Way - has indeed been ameliorated by the 
discovery of more fast-moving M3 1 satellites. 

It seems particularly intriguing that such satellites exist for 
both the Milky Way and M3 1 . Wilkinson & Evans ( 1999) used viri- 
alized models to estimate that the probability that, in a sample of 
30 satellites, there is an object like Leo I, which changes the mass 
estimate by a factor of a few. They found that the probability is 
minute, only ~ 0.5%. Prior expectation does not favour the exis- 
tence of objects like Leo I or And XII, yet in fact, both big galaxies 
in the Local Group possess such satellites. The clear conclusion 
is that the satellites in the outer parts of these galaxies cannot all 
be virialized. This is a point in favour of processes such as those 
advocated bv lSalesetai1 ( l2007l) to populate such orbits. 

4.3 Simultaneous Solution for Mass and Anisotropy 

There is one further way in which the estimators can be set to work 
with the line-of-sight velocities. When three dimensional positions 
and projected positions are simultaneously available - as for exam- 
ple in the case of M3rs satellites - it is possible to use the estima- 
tors based on both the (v^^^r") and the (v^^^R") moments to solve 
simultaneously for both the total mass and the anisotropy parame- 



Table 4. Table of proper motion data for the satellites of the Milky Way and 
M31. Listed are equatorial proper motions in mas century"' . 



Name 


/j„ cos (5 




Source 




(mas/century) 


(mas/century) 




Carina 


22 ±9 


15 ±9 


1 


Draco 


60 ±40 


110 ±30 


2 


Fornax 


48 ± 5 


-36 ±4 


3 


LMC/SMC 


198 ± 5 


25 ± 5 


4 


Sculptor 


9± 13 


2± 13 


5 


Sextans 


-26 ± 41 


10 ±44 


6 


Ursa Minor 


-50 ± 17 


22 ± 16 


7 


M33 


2.1 ±0.7 


2.5 ± 1.2 


8 


ICIO 


-0.2 ± 0.8 


2.0 ±0.8 


9 


M31 


2.1 ± 1.1 


-1.0 ±0.9 


10 



Sources: 1 - IPiatek et al.' bOOi), 2 - IScho lz & Irwin' /l994k 3 - 
[Piatek et al.l ^200^\) , 4 - Piatek et al.1 j2008h . 5 - Piatek et al. (200^, 6 - 
Walker et al. (2008), 7 - Piatek et al. 12005), 8 - Brunthaler et al. (20073), 
9 - Br unthaler et al.. a007b) , 10 - .van der Marel & Guhathakurta (^OoJ), 
though unlike the other proper motions, this not a measurement but inferred 
from indirect evidence. 

ter. There is however no guarantee that the solution for p is in the 
physical range -oo <p < 1. 

The success of this procedure of course rests on the accuracy 
of the data. The distances of the M31 satellites are determined by 
the tip of the red giant branch meth od and have errors of ±30 kpc 
(see e.g.. lMcConnachie et alj|200^ . If we use eqns l |23t and i26i , 
and simultaneously solve for the unknowns, we obtain 

M300 = 1.5 ± 0.4 X lO'^Mo, yS = -0.55!3:2 (33) 
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Figure 7. Distribution of mass estimates as a fraction of true mass for Monte 
Carlo simulations using (top) 23 satellites with radial velocities, (middle) 7 
satellites with proper motions and (bottom) 23 satellites, 7 of which have 
proper motions. The standard deviation of the best fitting Gaussian is shown 
in the top-right hand corner of each panel. [These plots assume p = -4.51, 
as estimated from the data] . 
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Figure 8. The fractional contribution each satellite with proper motions 
makes to the mean mass estimate for the Milky Way Galaxy. Notice the 
extreme effect of Draco's proper motion. 



which corresponds to mild tangential anisotropy. These are surpris- 
ingly sensible answers given the distance errors. 

Fig. [6] is inferred from Monte Carlo simulations and shows 
the distributions of anisotropy parameters derived from simultane- 
ous mass and anisotropy fitting for mock datasets. Also given in 
the panels are the median and 68 per cent confidence limits for the 
anisotropy parameter, in the case of 21 satellite galaxies (compara- 
ble to the present dataset for M31) and the case of 500 satellites. 
Although with 21 tracers, the errors on the anisotropy parameter 
are substantial, matters improve significantly with larger numbers 
of tracers. A dataset of 500 halo satellites (dwarf galaxies, globular 
clusters and planetary nebulae) is not unreasonable for a galaxy like 
M31 in the near future. This raises the possibility that the method 
of simultaneous fitting may prove more compelling in the future. In 
fact, given 500 tracers, it is reasonable to use the estimators based 
on both the {v^^/") and the (v^^^R") moments to fit simultaneously 
at each distance, thus giving the run of anisotropy parameter and 
mass with radius. 



4.4 Radial and Proper Motion Datasets 

Thus far, we have used only the line of sight velocities to make mass 
estimates. In this section, we add in the proper motions of satellites, 
where available. Thus, for the Milky Way galaxy, we combine re- 
sults from eqn l |16l ) for satellites without proper motions and from 
eqn i24i for those with proper motions, weighting each estimate by 
the reciprocal of the standard deviation to give the final answer. 

Proper motions, albeit with large error bars, have been mea- 
sured for a total of 9 of the Milky Way satellite galaxies. It seems 
prudent to exclude Sagittarius, as it is in the process of merg- 
ing with the Milky Way. Additionally, the interacting Magellanic 
Clouds are treated as a single system by computing the proper mo- 
tion of their mass centroid, taking the masses of the LMC and SMC 
as ~ 2 X IO'^Mq and 2 x 10' Mq respectively (Kroupa & Bastia^ 
Il997h . This leaves us with a set of 7 satellites with proper mo- 
tion data, summarized in Table O In most cases, errors on proper 
motions are large and, where multiple studies exist, the mea- 
surements are sometimes in disagreement. The proper motions 
inferred by ground based methods are in reasonable agreement 
with those derived fr om the Hubble Space Telescope (HS T^ in 
the cases of Fornax /Piatek et al.ll2007'; 'Walk er et al.ll2008 >). Ca- 
rina (Piateketal. 2003; Walker etal. 2008) and the Magellanic 
Clouds (Piateketal. 2008; Costa et al. 2009). But, for Ursa Mi- 
nor (Scholz&Irwin 1994; Piateketa l. 2005) and for Sculptor 
jPiatek et alj|2006l ; l\\^lker et al.ll2008l) . agreement between diflter- 
ent investigators is not good, and we have preferred to use the es- 
timates derived from HST data. Nonetheless, it is important to in- 
clude the proper motion data, especially for mass estimates of the 
Milky Way Galaxy. We use these proper motions along with dis- 
tance and line-of-sight velocity data t o calculate full spac e veloci- 
ties for these satellites, as described in IPiatek et al.U2002l) . 

In addition, there are two satellites of M3I with measured 
proper motions, namely M33 and IC 10. This astonishing feat has 
exploited the Very Long Baseline Array to measure the positions 
of water masers relative to bac kground quasars at multiple epochs 
( iBrunthaler et alj|2005l [2007bl) . Unfortunately, the technique can- 
not be extended to M3I itself, as it does not contain any water 
masers, a nd so its proper motion is much less securely known. 
However, Ivan der Marel & Guhathakurtal ( |2008|) reviewed the ev- 
idence from a number of sources - including kinematics of the 
M31 satellites, the motions of the satellites at the edge of the Lo- 
cal Group, and the constraints imposed by the tidal distortion of 
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M33's disk - to provide a best estimate. Tliese data are also listed 
in Tableg] 

The Milky Way satellites are so remote that their line-of-sight 
velocities in the Galactic rest frame are almost identical to their 
radial velocities, as judged form the Galactic Centre. The proper 
motion data provide genuinely new information on the tangential 
motions and this is the only way to break the mass-anisotropy de- 
generacy. The same argument does not hold with equal force for 
M31, as the line of sight velocities incorporate contributions from 
both the radial and tangential components as reckoned from the 
centre of M31. Nonetheless, it is good practice to use all the data 
available, even though the proper motions of M33 and IC 10 with 
respect to the M3 1 reference frame must be inferred using an esti- 
mate of M3rs proper motion (rather than a measurement). 

For the satellites without proper motions, we use the form of 
the estimator given in eqns l |16l l or l |23t for the Milky Way and 
M31 respectively; for those with proper motions, we use eqn J24t . 
We combine results from the two estimators, weighting each esti- 
mate by the reciprocal of the standard deviation to give the final 
answer To infer the standard deviation, we perform Monte Carlo 
simulations. So, for the case of the Milky Way, we generate mock 
datasets of 25 satellites, for which only 7 have proper motions. The 
errors on radial velocities are dwarfed by the uncertainty caused by 
the small number statistics and so are neglected. But, the errors on 
the proper motions are not negligible and they are incorporated into 
the simulations by adding a value selected at random from the range 
[-0.5 fi, 0.5 yu], where yu is the proper motion. The flat distribution 
has been chosen as systematic errors are as important as random 
Gaussian error in the determination of proper motions. However, 
we have tested alternatives in which we use the relative observa- 
tional errors, or the relative observational errors multiplied by 2.5, 
and find that our results are robust against changes to the error law. 
The standard deviations of the fractional mass distribution the satel- 
lites with and without proper motions are separately computed, as 
illustrated in the panels of Figure|7] We linearly combine the mass 
estimates, weighting with the reciprocal of the standard deviation, 
to give the final values reported in Table [3] 

Given that the Milky Way satellites with measured proper mo- 
tions are moving on polar orbits, it is no surprise that the mass esti- 
mate of the Milky Way has now increased. Adopting the value of /? 
we estimate from the data, we find M300 = 3.9 ± 0.7 x IO'^Mq for 
the Milky Way Galaxy and M300 = 1.5 + 0.4X IO'^Mq for M31. As- 
suming isotropy, we find M300 = 2.5 ± 0.5 x IO'-Mq for the Milky 
Way Galaxy and M300 = 1.4 ± 0.4 x IO'^Mq for M31. Notice how- 
ever, the mass estimate for M3 1 has barely changed from the value 
inferred from the full radial velocity dataset. 

Again, we calculate the contribution that each satellite makes 
to the mass estimate to investigate whether any are dominating the 
final answer. First, this procedure guards against the possibility of 
a completely rogue proper motion measurement. Second, there are 
some suggestions that the Magellanic Clouds may not be bound, or 
even if bound may only be on its second passage and so may not 
be pa rt of the relaxed distribution of satellite galaxies jBesla et all 
l2007h . So, it is helpful to check that our results are not unduly sen- 
sitive to its inclusion. As Figure [8] shows, we find that Draco is a 
clear outlier and nearly doubles the Milky Way mass estimate. If we 
remove the Draco proper motion from the sample, we instead re- 
cover a mass M300 = 2.7 ± 0.5 x IO'^Mq (assuming ySjata) or M300 = 
1.4 ± 0.3 xlO'^Mo (assuming isotropy). It is particularly concern- 
ing that the proper motion of Draco has such a substantial effect, 
because - as judged from the size of the error bars in Table|4]- it is 
one of the noisier measurements. By contrast, the exclusion of the 



Magellanic Clouds has only a minor effect, as is evident from the 
results listed in Table[3] 

We have covered a number of possibilities, so it is probably 
useful for us to give our best estimates. On balance, we think the 
case for including at least And XIV among the satellite sample for 
Andromeda is strong. Whilst And XII is a more ambiguous case, 
the lack of any HI gas suggests to us that it should also be included. 
Among the satellites o f the Milky Way, w e favour including Leo 
I based on the work of ISohn et alj j2007l) . whilst we are inclined 
to disc ard the proper motion of Draco reported in lScholz & IrwinI 
until corroborated. Until the discrepancy between the veloc- 
ity anisotropics reported in simulations and in data is explained, we 
prefer to use the data as our guide 

So, our best estimate for the mass of the Milky Way within 
300 kpc is 

M300 ~ 2.7 ± 0.5 X lO'-Mo (34) 
whilst for M31, it is 

M300 ~ 1.5±0.4x lO'-Mo. (35) 

These estimates are obtained using the combined radial velocity 
and proper motion datasets. The error bars only incorporate the sta- 
tistical uncertainty. As we have emphasised, there are much greater 
uncertainties induced by selection of satellite members and velocity 
anisotropy. In particular, when these uncertainties are considered, 
it is not possible to decide which of the Milky Way or M31 is more 
massive based on satellite kinematic data alone. 



5 DISCUSSION 

It is instructive to compare our results with a number of recent es- 
tim ates of t he masses of the Local Group and its component galax- 
ies. [Xue^eTal. (2008) extracted a sample of ~ 2400 blue horizontal 
branch stars from the SDSS. These are all resident in the inner halo 
within 60 kpc of the Galactic centre. This has the advantage that the 
BHBs are surely virialized, but the disadvantage that no inference 
can be made about the mass exterior to 60 kpc. Hence, any estimate 
as to the total mass is driven wholly by prior assumptions rather 
than the data. In fact. lXue et"al] ( I2OO8I) assumed an NFW halo with 
a canonical concentration holds, and then estimated the virial mass 
of the Milky Way's dark matter halo as M = LO+55i x IO'^Mq, 
using Jeans modelling with an anisotropy parameter inferred from 
numerical simulations. This is lower than our preferred value, but 
in good agreement with our comparable calculations using line of 
sight velocity datasets alone. 

A somewhat s imilar calculation for M3 1 has been reported by 
ISeigar et al.l ( |2008|) . The mass of the baryonic material is estimated 
using a Spitzer 3.6 jim image of the galaxy, together with a mass- 
to-light ratio gradient based on the galaxy's B — R colour This is 
combined with an adiabatically-contracted NFW halo profile to re- 
produce the observed HI rotation curve data. They find a total virial 
mass of M3rs dark halo as 8.2 ± 0.2 x 10" Mq. This is lower than 
most of our estimates, with the exception of those based on samples 
excluding both And XII and And XIV. 

Although these calculations are interesting, it is worth remark- 
ing that the final masses are not wholly controlled by the data. We 
know that, from Newton's theorem, any mass distribution outside 
the limiting radius of our data has no observational effect in a spher- 
ical or elliptical system. To estimate the virial mass from data con- 
fined to the inner parts (such as BHBs or the optical disk) requires 
an understanding of the structure of the pristine dark halo initially. 



© 0000 RAS, MNRAS 000, 000-000 



The Masses of the Milky Way and Andromeda galaxies 13 



as well as how it responds to the formation of the luminous bary- 
onic components. It is th is that controls the final answer. 

Li & Whiti ( l2008h used the Millennium Simulation to ex- 
tract mock analogues of the Local Group and calibrate the bias 
and error distribution of the Timing Argument estimators (se e 
e.g., lKahn&Woltie3ll959l : iRavchaudhurv & Lvnden-Belllll989h . 
From this, they obtain a total mass of the two large galaxies in 
the Local Group of 5.3 x IO'^Mq with an inter-quartile range of 
[3.8 X 10'^ 6.8 X 10'^] Mb and a 95 % cori fidence lower limit of 
1.8x IO'^Mq. Importantlv. lLi & Whitj ( l2008h showed that the mass 
estimate from the timing argument is both unbiased and reasonably 
robust. This is a considerable advance, as there have long been wor- 
ries that the gross simplification of two-body dynamics implicit in 
the original formulation of the Timing Argument may undermine 
its conclusions. 

It therefore seems reasonable to assume that the combined 
mass of the Milky Way Galaxy and M31 is at least 3.8 x lO'^Mo, 
and perhaps more like 5.3 x l O'-Mp. The low esti mates of the 
Milky Way and M31 masses of IXue et all ( l2008h and lSeigar et al.l 
( I2OO8 ') are not compatible with this, and barely compatible with Li 
& White's 95 % lower limit. Using our preferred values in eqns ([34} 
and ( I35t . the combined mass in the Milky Way and M31 galaxies 
is 4.2 ± 0.6 X 10'^Mq. This is comparable to the 3.8 x IO'-Mq of Li 
& White. 

iLi & Whit3 j2008h also estimated a virial mass for the Milky 
Way of 2.4 x IO'^Mq with a range of [1.1 x 10'', 3.1 x 10'^] Mq, 
based on timing arguments for Leo I. Given all the uncertainties, 
this is in remarkable accord with our best estimate. 



6 CONCLUSIONS 

We have derived a set of robust tracer mass estimators, and dis- 
cussed the conditions under which they converge. Given the po- 
sitions and velocities of a set of tracers - such as globular clus- 
ters, dwarf galaxies or stars ~ the estimators compute the enclosed 
mass within the outermost datapoints. The accuracy of the esti- 
mator has been quantified with Monte Carlo simulations. The es- 
timators are applicable to a wide range of problems in contem- 
porary astrophysics, including measuring the masses of elliptical 
galaxies, the haloes of spiral galaxies and galaxy clusters from 
tracer populations. They are considerably simpler to use than dis- 
tribution function based methods (see e.g. Little & Tremainel 19871 : 
iKulessa & Lvnden-Belllll992l : IWilkinson & Evanslll999l) , and hi- 
volve no more calculation than taking weighted averages of com- 
binations of the positional and kinematical data. They should find 
widespread applications. 

The mass estimators are applied to the satellite populations of 
the Milky Way and M3 1 to find the masses of both galaxies within 
300 kpc. These estimates are the first to make use of the recent burst 
of satellite discoveries around both galaxies. Both satellite popu- 
lations have nearly doubled in size since previous estimates were 
made. We summarise our results by answering the questions; What 
are (1) the minimum, (2) the maximum and (3) the most likely 
masses of the Milky Way and M31 galaxies? 



(1) The mass of the Milky Way Galaxy within 300 kpc could be as 
low as 0.4 ± 0.1 X 10''Mq. This would imply that Leo I is grav- 
ita tionally unbound, contrary to the recent evidence provided by 
bv lSohnetai] ( l2007h . Leo I would then be either an interloper or an 
object being ejected from the Milky Way by an encounter. It would 
also require that the proper motion of Draco jScholz & IrwinI 19941) 
is incorrect, which is not inconceivable given the difficulty of the 



measurements. It implies that the satellite galaxies are moving on 
radial orbits and so the velocity anisotropy is radial. 

The mass of M31 within 300 kpc could plausibly be as low 
as 0.8 ± 0.2 X IO'-Mq. This would be the case if both And XII and 
And XIV are not gravitationally bound, which is po ssible if mecha- 
nisms such as those proposed bv lSalesetal]j2007l) are ubiquitous. 
It would also require that the proper motion data on M33 and ICIO 
or - perhaps more likely - the indirectly inferred proper motion of 
M3 1 is in error. Again, such a low estimate for the mass occurs only 
if the satellites are moving predominantly radially. 

Although it is interesting to ask how low the masses of the 
Milky Way and M31 could be, it does produce a mystery in the 
context of the Timing Argument, which typically yields larger 
combined masses. It is possible that some of the mass of the Lo- 
cal Group is unassociated with the large galaxies. Although not 
the conventional picture, this is probably not ruled out and there 
have been suggestions that ~ 10'"Mo may be present in the Lo- 
cal Grou p in the form of bar yons in the warm-hot intergalactic 
medium dNicastro et alj[2003h . There are few constraints on the 
possible existence of dark matter smeared out through the Local 
Group, and unassociated with the large galaxies. However, the clus- 
tering of the dwarf galaxies around the Milky Way and M3 1 does 
suggest that the gravity of the dark matter is centered on the promi- 
nent galaxies. 

(2) The largest mass estimate we obtained for the Milky Way 
Galaxy is 3.9 ± 0.7 x IO'^Mq. This extreme values is driven by the 
assumption of tangential anisotropy for the satellites, so that the 
measured line of sight velocities also imply substantial tangential 
motions as well. The estimate assumes all the satellites including 
Leo I to be bound, and the anomalously high proper motion mea- 
surement of Draco to be valid. 

Note that the present data allow considerably more scope to 
increase the mass of the Milky Way Galaxy than M3 1 . Our largest 
mass estimate for M3 1 is a much more modest 1.6 ± 0.4 x IO'^Mq, 
which occurs when we analyse the whole sample incorporating 
And XII and And XIV and assume tangentially anisotropic velocity 
distributions. 

The current concensus is that the two galaxies are of a roughly 
similar mass, with M31 probably the slightly more massive of the 
two. This though is inferred from indirect properties, such as the 
numbers of globular clusters, which correlates with total mass al- 
beit with scatter, or the amplitude of the inner gas rotation curve. 
The stellar halo of M3 1 is certainly more massive than that of the 
Milky Way, although this may not be a good guide to the dark halo. 
Of course, it could be that the current concensus is wrong, and that 
the Milky Way halo is more massive than that of Andromeda. There 
is also some indirect evidence in favour of this - for example, the 
typical sizes of the M3 1 dwarf spheroidals are large than those of 
the Milky Way, which is explicable if the Milky Way halo is denser. 
However, it does not seem reasonable to postulate that the mass of 
the Milky Way is substantially larger than that of M3 1 . Hence, the 
very large estimate of 3.9 ± 0.7 x IO'^Mq is best understood as a 
manifestation of the degeneracy in the problem of mass estimation 
with only primarily radial velocity data. 

(3) Our preferred estimates come from accepting Leo I, And XII 
and And XIV as bound satellites, whilst discarding the Draco 
proper motion as inaccurate. This gives an estimate for the mass of 
the Milky Way within 300 kpc as 2.7 ± 0.5 x IO'^Mq and for M31 
as 1.5 ± 0.4 X IO'^Mq, assuming the anisotropy implied by the data 
(J3 X -4.5). The error bars are just the statistical uncertainty and do 
not incorporate the uncertainty in anisotropy or sample member- 
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ship. In view of this, it is not possible to decide which of the Milky 
Way galaxy or M31 is the more massive based on the kinematic 
data alone. 

These values for the masses are attractive for a number of 
reasons. First, the mass ratio between the Milky Way and M3 1 is 
of roughly of order unity, which accords with a number of other 
lines of evidence. Second, the values allow most of the dark matter 
in the Local Group implied by the Timing Argument to be clus- 
tered around the two most luminous galaxies. Third, they are within 
the range foun d for cosmological ly motivated models of the Milky 
Way and M3 1 jLi & Whitel20o3) . 

We prefer to assume the anisotropy implied by the admittedly 
scanty data on the proper motions of the satellites. However, for 
completeness, we quickly sketch the effects of dropping this as- 
sumption. If the velocity distribution is isotropic, or even radially 
anisotropic as suggested by the simulations, then the mass of the 
Milky Way becomes 1.4 ± 0.3 x IO'-Mq or 1.2 ± 0.3 x IO'-Mq re- 
spectively. Similarly for M31, the values are 1.4 ± 0.4 x IO'-Mq 
(isotropy) or 1.3 ± 0.4 x IO'^Mq (radially anisotropic). 

The greatest sources of uncertainty on the masses remain the role of 
possibly anomalous satellites like Leo I and the velocity anisotropy 
of the satellite populations. There is reason to be optimistic, as 
the Gaia satellite will provide proper motion data on all the dwarf 
galaxies that surround the Milky Way and M31, as well as many 
hundreds of thousands of halo stars. The analysis that we have car- 
ried out here indicates that proper motions are important if we wish 
to increase the accuracy of our estimates, as well as understand the 
dynamical nature of objects like Leo I. While we are not yet able 
to exploit the proper motions, Gaia will allow us to do so. 
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